Instructions
- This is a team assignment.
- You need to write a report with answers to each of the questions.
- Turn in the
html output file, and also the Rmd file.
- Total points for the assignment is 20. Five points of the score from the assignment will be given by another team, who will give you full marks if they can compile your report, and get the same answers as you, and find your explanations of the plots understandable and informative. Five points will be for individual effort. And the remaining 10 points will be the final group report.
Exercise
Melbourne property prices have taken their biggest hit since 2012, falling by almost 2 per cent in the past three months Jim Malo, Jul 26 2018, Domain
This assignment explores the data provided on Melbourne house prices by Anthony Pino. The goal is to examine whether housing prices have cooled in Melbourne, and help Anthony decide whether it is time to buy a two bedroom apartment in Northcote.
Your tasks
- Make a map of Melbourne showing the locations of the properties.

- Filter the data to focus only on the records for Northcote units. Make a plot of Price by Date, facetted by number of bedrooms. The main thing to learn from this plot is that there are many missing values for number of bedrooms. Impute the missing values, based on the regression method (covered in class). Make sure your predicted value is an integer. Re-make the plot of Price by Date, facetted by number of bedrooms. Make a description of what you learn from the plot, particularly about the trend of 2 bedroom unit prices in Northcote.


Using the many models approach, fit a linear model for Price on Date, separately for every suburb, for 2 bedroom units. You will need to impute the Bedroom2 variable, in the same way done in the previous question. Collect the model estimates, and also the model fit statistics. What suburb has had the largest increase, which has had the biggest decrease in prices? Which suburbs have the worst fitting models?
Examine the results of the auctions, with the Method variable, across suburbs. Perhaps compute proportion of PI, NB, VB, W, relative to the rest of the categories, by month over the years. The goal here is to determine if the downturn might be indicated by a higher proportion of unsold properties.
Fit a big model!
Grading
Points for the assignment will be based on:
- Accuracy of your answers to the given questions.
- How well you have conducted the analysis, and written your answers.
- Whether the Rmd file, can take data file as provided, produce the tidy data, and plots reported in your final submission.
- How well you have worked with your group members on completing this work. Your score for the group component will be adjusted by the amount of effort your team mates independently and anonymously report.